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Abstract 

The structure of liquid N-methylformamide (NMF) has been investigated using 
synchrotron radiation at 77 and 95 keV. The use of high energy photons has several 
advantages, in this case especially the large accessible momentum transfer range, the 
low absorption and the direct comparability with neutron diffraction. The range of 
momentum transfer covered is 0.6 A -1 < Q <24.0 A -1 . Neutron diffraction data on 
the same sample in the same momentum transfer range have been published previously. 
In that study two differently isotope - substituted species were investigated. In order to 
compare neutron and photon diffraction data properly Reverse Monte Carlo (RMC-) 
simulations have been performed. Some modifications had to be added to the standard 
RMC- code introducing different constraints for inter- and intramolecular distances 
as these distances partly overlap in liquid NMF. RMC- simulations having only the 
neutron data as input were carried out in order to test the quality of the X-ray data. 
The photon structure factor calculated from the RMC- configurations is found to agree 
well with the present experimental data, while it deviates considerably from earlier X- 
ray work using low energy photons (17 keV). Finally we discuss whether the different 
interaction mechanisms of neutrons and photons can be used to directly access the 
electronic structure in the liquid. Evidence is presented that the elastic self scattering 
part of liquid NMF is changed with respect to the independent atom approximation. 
This modification can be accounted for by a simple charged atoms model. 



1 Introduction 

Liquid N-methylformamide (NMF) OHC'-NH(CH3) is the simplest molecule containing the 
peptide bonding system 0=C'-N- relevant for the structure of proteins. It has a very high 
dielectric constant (167.1 at 35 °C as compared to 74.8 for water). This fact has been 
attributed to the ability of forming hydrogen bonded chains aligning the molecular dipoles 
since the early measurements of Cole et al. 1964 |l]]. The peptide system is believed to 
be essentially flat, having a barrier to internal rotation around the N-C' bond due to the 
delocalization of the 0=C- N- 7T- bonding system The barrier to internal rotation gives 
rise to two different isomers: cis- and trans- NMF (trans is defined to be the isomer with 
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the oxygen and the amide hydrogen on different sides of the N— C- bond). The trans-isomer 
is the more stable one being present at room temperature to about 90-95% in the liquid. 

These interesting properties have lead in the past to two liquid structure determinations 
in spite of the relative complexity of the molecule giving rise to at least 28 independent 
partial structure factors, when treating the methyl protons as equivalent. First Ohtaki et 
al. H used conventional X-rays (Mo-K Q : 17.4 keV) for the structure determination and 
interpreted their data in terms of chains of trans-NMF molecules. Next, neutron diffraction 
was done on two different isotope-substituted species, one being fully deuterated (NMF-d5), 
the other containing one hydrogen nucleus in the amide position: ODC-NH(CD3) (NMF-d4) 
pj (this reference henceforth referred to as I) . The neutron data will be used in the present 
paper to check the quality of the high energy photon data as well as an additional source of 
information for the RMC simulations. 

Surprisingly, the crystal structure of NMF has been determined only recently Q , whereas 
there is a couple of crystal structure determinations available for similar molecules such as 
formamide || [?], || , acetamide 0, [l(]] and N-methylacetamide Jllj . In the crystal NMF 
forms helical chains of hydrogen bonded trans-molecules. The molecular structure in the 
gas phase on the other hand has been determined by electron diffraction [ |l2"| , microwave 
spectroscopy |R| and several quantum mechanical calculations |l4|, [15, 16, 1?J. As a result 
the C'=0 bond is found to be substantially shorter in the gas phase then in the solid (and 
also shorter than in the liquid, as stated in I) indicating a higher double bond character, 
whereas the C'-N bond is longer. 

The interpretation of X-ray diffraction experiments on liquids is typically done in the 
independent atom approximation neglecting the redistribution of electrons on molecule for- 
mation. This fact has been used by 'neutron diffractionists' to argue that photon diffraction 
has a lower inherent precision than neutron diffraction. Thinking positive one can hope 
to learn something about the electronic structure in the liquid by comparing neutrons and 
photons. However, the effect of redistribution of the valence electrons is very difficult to 
observe in a liquid as it is small. Its observation therefore requires a high precision on both 
the neutron and the photon side of the experiment. A further difficulty is the fact, that the 
absorption cross sections for conventional X-rays (8 to about 20 keV) in general are much 
larger than cross sections for neutrons, so that photon diffraction has to be done under 
conditions quite different from a typical neutron diffraction experiment (reflection geome- 
try or very thin samples instead of the large samples used in neutron diffraction) . As the 
photoelectric absorption cross section approximately decreases with E 3 this difficulty can 
be circumvented by using high energy photons (~ 100 keV). The conditions for performing 
such experiments in a reasonable time have been established recently [pl|. NMF is a very 
promising object for observing deviations from the independent atom approximations since 
75% of its electrons are valence electrons and the polarization of the bonds is expected to 
be high. 



2 Experimental Set-up 

The experiment was performed on the Hard X-ray triple axis diffractometer recently build 
at HASYLAB [jl9| ^TJ. The diffractometer was provisionally installed on beamline BW7 
at DORIS-III, a conventional wiggler beamline not optimized for high energy use. At the 
time of the experiment electrons of 4.45 GeV were stored in DORIS-III. Slightly different 
set-ups were used for the low- and high-Q part of the spectra, one covering the range from 
0.6-14.6 A -1 at an incident photon energy of 76.8(8) keV, the other from 8-24 A" 1 at 95.5(9) 
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keV. The energies were determined by calibrations of a multi channel analyser with a 133 Ba 
radioactive source and from the powder lines of an (external) aluminium standard. The small 
scattering angles at high energies allow for a tangential move of the cooled Ge solid state 
detector. No provision was made to exclude the air. The beamline contains no focussing 
parts. The spectrometer was operated in a two- axis mode and no separation of inelastic 
(Compton-) scattering was attempted. The data acquisition was carried out in several scans 
in order to minimize errors caused by beam fluctuations and drift in general. The liquid 
NMF - vacuum distilled shortly before the experiment - was contained in a cylindrical glass 
tube of 3 mm diameter and 10 /im wall thickness. The present set-up leads to count-rates 
from 10000-500 s -1 . 100000- 400000 counts were accumulated at 670 points in about 20 
h. In general, the conditions were quite similar to the previously described experiment on 
glassy Si0 2 g|. 

3 Data correction and error estimation 

The raw data - after summing up the individual scans - are shown in figure @. These 
data were subsequently corrected for background (mainly scattering from air and sample 
container), absorption, multiple scattering, polarization of the incoming beam, the variation 
of the solid angle seen by the detector as a result of the changing sample to detector distance 
(tangential movement, see above) and the increased detection efficiency for inelastically 
scattered photons, /ir is 0.027 at 76.8 keV (with n being the total absorption coefficient and 
r the radius of the sample) and hence both absorption and multiple scattering corrections are 
very small. Air scattering is the dominant background contribution, especially at low Q's. 
The main error sources in this study are therefore believed to be the instabilities induced by 
a different response of the main detector and the monitoring system to the synchrotron beam 
fluctuations, the knowledge of the degree of linear polarization of the monochromatic beam 
(assumed to be known within 1%), the distance sample to detector (equally known approx. 
within 1%) and the energy calibration. The mean energy shift of the inelastically scattered 
photons at Q=23 A -1 is 1.6 keV at 95.5 keV. The effect of an increasing detection efficiency 
for those photons is estimated to be a 1% effect. For a more detailed description of the 
data correction procedure the reader is referred to |L8) . The fully corrected and normalized 
intensity is presented in figure ||. 

4 Simulation details 

In RMC- simulations ]|]J a suitable number of atomic or molecular entities is placed in a 
cubic (or differently shaped) simulation box. A Markov- chain is generated accepting moves 
provided the move is improving the agreement with experiment, or if a random number in 
the range [0 : 1] is less than £i P ex P(- A X?/2of ) where the sum extends over all Nexp 
independent experiments. Otherwise the move is rejected. A% 2 is the difference between 
the x 2 °f the old and the new configuration, where \\ is defined here as: 

w pointa 

X 2 = i S (Q)conf,i ~ S{Q) exp . l } 2 (1) 

where the sum extends over all iVpoints points in Q-space , S(Q) is defined by Eqs. || and 
|J erf is a measure of the assumed error in data set i, the subscript conf refers to the 
S(Q) calculated from the configuration generated and the subscript exp to the experiment. 
RMC simulations in polyatomic molecular systems are not as straightforward as in atomic 



3 



systems. It is accepted, that the molecules should not be treated as rigid because the 
molecular part of the scattered intensity is then reproduced poorly. If on the other hand 
the atoms are treated as free one has to introduce the information that the atoms are 
connected and form a molecule. The approach used by Radnai et al. ps| based entirely on 
coordination constraints was found to be unsuitable in our case, since in NMF inter- and 
intramolecular distances overlap. As a consequence two problems arise: a% and a-i are the 
distances between which a next neighbour of type B is expected around an atom of type A 
within the molecule ( so that a\ < tab (intramolecular) < a-i) and a 3 is the smallest distance 
at which a molecule of type B on a different molecule is expected around A. In the standard 
RMC code no distinction is made between inter- and intramolecular distances, thus even 
if a\ < a,2 < 03 error messages occur as the intramolecular distances between a\ and ai 
violate the constraint that no AB distance should be present below a$ (hard core radius). 
This is still not disadvantageous, but if a\ < a 3 < a 2 the RMC code tries to satisfy both 
the coordination constraint between a\ and ai and the hard core radius 03. As a result 
all intramolecular distances are now comprised between 03 and 0,2 what is not intended. If 
finally 03 < a± < a-i there are real mtermolecular distances between a\ and 0,2 and hence 
the total coordination in the range \a\ : ai\ is no longer known. The only way to treat this 
problem is to fix explicitly which atoms belong to the same molecule and to give different 
coordination constraints and cut-off radii for inter- and intramolecular distances. This can 
not be done with the standard RMC, version III codc[f23f. 

In the present simulation a 9 site model with 216 molecules corresponding to 1744 atoms 
was used. The distance of closest approach was fixed to 2.0 A for all intermolecular distances 
except the O- • -H-N were it was 1.5 A. The molecular density was taken to be 0.01022 A -3 
corresponding to a half box length of 13.85 A. The intramolecular constraints are given in 
Table I and are chosen to give essentially planar trans-molecules, where the methyl protons 
are free to rotate around the N-C bond. The range in which the intramolecular distances 
can fluctuate is chosen to be four times the Debye- Waller factor as given by Kitano et al. 




5 Analysis 

In order to assess the quality of the high energy photon data a RMC simulation (modified as 
described above) has been carried out using only the neutron data from I as input. From the 
configurations generated we calculated the corresponding photon scattering. We compare 
the result to our experimental data as well as to the data of Ohtaki et al. j|] obtained with 
Mo-K Q X-rays. 

The RMC fit to the two neutron data sets and its first order difference is shown in figure 
[| in Q-space. The definitions of the functions shown are: 

chr_\ 

S(Q) = ff 2 (2) 



,(r) = 1 + / ^ QS(Q)sm(Qr)dQ, 

z7r pr Jq 



where (da / dil) dist is the distinct differential cross section, the sum is extended to all Nm 
atoms in the molecule, hi is the neutron scattering length of atom i, p is the macroscopic 
number density, Q max is the largest accessible momentum transfer where S(Q) = 0, S(Q) 
is the neutron weighted structure factor and gn{f) is the neutron weighted pair distribution 



4 



function. The agreement between simulation and experiment is good within the statistics 
given by the number of sites and the chosen a- parameter for the total structure factors, 
but not perfect for the hydrogen first order difference in the low-Q region. The first order 
difference is given here by: 

„ c EjV M (d4) h i) 2 „ , Q s 

Note, however, that the first order difference has been enlarged by a factor of 5. The fit is 
considered to be good enough for our purposes {yfx 2 /^points defined by Eq. [l] is 0.0032 for 
NMF-d5 and 0.0043 for NMF-d4) . 

The next question to ask is whether or not the configurations consistent with the neutron 
data are also consistent with the photon diffraction data. With definitions analogous to those 
appearing in Eq. 2: 

da 



i(Q) = 7#^ff| (4) 



g x (r) = 1 + -i— [ Qmal Qi(Q) sin(Qr)dQ 

the comparison RMC (neutrons) versus experiment (photons) is done in Fig. || and |^. i(Q) 
is here the photon weighted structure factor, g x (r) the photon weighted pair distribution 
function and fa the form factor of the i th atom. 

Focussing first on the real space comparison two things should be noted: First, g x (r) is 
not equal to zero at r < r cut for the Fourier- transform of the RMC S(Q) - where r cut is 
the smallest internuclear distance. This is known to be caused by the fact that the terms 
fifj/(J2f) 2 are n °t constant in Q. Second, there is a much larger deviation from zero 
for the Fourier-transform of the experimental S(Q). It is current practice to back Fourier- 
transform the deviation of g x {r) from zero at low r to get an idea about the size and the 
location of systematic errors in Q-space |24] However, this approach is known to be incorrect, 
as g x {r) may deviate from zero even for data completely free from systematic errors. We 
have therefore chosen to use the following 'correction'-function instead: 

K(Q) = C£V 2 ^ I ^[gxiexp) -g x (RMC)} S in(Qr)dQ (5) 

K(Q) will be shown to be largely related to deviations from the independent atom approx- 
imation. 

The form factor of a spherically symmetric atom is given by p5j : 



z ^ c 

fiaa(Q) = y>7T/Q / 



ipi(r)r sin(Qr)dr (6) 



where Jiaa is the form- factor of the unperturbed atom, Z is the number of electrons within 
that atom and ipi is the wave-function of the i th electron. Using Roothaan-Hartree-Fock 
(RHF) atomic wave-functions as tabulated by Clementi et al. |^6| the above integral can be 
solved analytically. 

Now leaving the independent atom approximation, a very simple approximation of atoms 
in molecules is the assumption that the electron cloud of the atoms keep its form and 
spherical symmetry but that the atoms can adopt net charges [E7j. As a consequence the 
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valence shell of the respective atom is contracted or expanded, while the core electrons 
remain unaffected: 

i>valence( r ) = *Pvalence( Kr ) CO 

The simplicity of this approach keeps the integrals of Eq. [| to be solved analytically. The 
calculations are presented in the appendix. A comparison between fi)f aa — fi)cm 
and K(Q) is done in Fig. || (iaa for independent atom approximation and cm for charged 
atoms model). The charges of the atoms have been taken from formamide in the solid state 
H and it is assumed that the methyl group takes the charge of one amide hydrogen. The 
charge parameters are given in Table |. Using now the modified form-factors fi tC m in the 
analysis the low r deviations of g x (r) are very much reduced as shown in Fig. ||, except for 
the peak at 0.3 A. Given the crude model used, this result is very satisfying. One can ask 
now whether the peak at 0.3 A also has a physical meaning or is caused by experimental 
errors, but further analysis seems to be speculative without supporting quantum-mechanical 
calculations. 

In Fig. ^ i{Q) from our experiment and the one from Ohtaki's experiment || are com- 
pared to the RMC result (with the neutron measurements as input only) . The discrepancies 
of Ohtaki's data in the first diffraction peak are immediately evident. This is rather aston- 
ishing as in the region Q > 3A - dominated by intramolecular distances - the agreement 
between Ohtaki's data and the simulation is very good, even better than the agreement 
achieved in the present study. It is noted that Ohtaki's data do not extrapolate to the 
correct low Q limit given by the compressibility. On the other hand the agreement between 
the high energy photon result and the shape of the first maximum predicted by the neutron 
result is nearly perfect. Concluding, there seems to be a larger problem in Ohtaki's data 
analysis and the conclusions drawn in his paper should therefore be accepted with care. 

Including the hard X-ray data in the RMC simulations, does not remove the slight 
discrepancies between simulation and experiment in the range from 3-8 A -1 still present 
in Fig. [?]. From another point of view - these discrepancies are not of a type which could 
be removed by simply moving atoms in the box within the given constraints. Hence, as 
stated above, these discrepancies are caused either by systematic errors still present or by 
deviations form the independent atom approximation not accounted for by the primitive 
model used. This fact also highlights, that the x-ray diffraction pattern is predicted by the 
neutron diffraction data, and the x-ray data are in excellent agreement with the neutron 
data. 

Likewise, introduction of 10% cis- molecules does not substantially improve the fit. Nev- 
ertheless, the configurations generated in this way might be closer to reality, as it is known 
that a certain percentage of cis-molecules are present in the liquid. The cis/trans isomerism 
only changes rather long intramolecular distances - smeared by large amplitude vibrational 
motions - which can be modelled by intermolecular distances without coming into conflict 
with the imposed constraints. 

Finally, one aim of the present study was to test the conclusion drawn in I, that the 
molecular structure of liquid NMF is more like in the solid and not like the gas phase struc- 
ture, hence, answering the question, whether long range order is a necessary condition for 
the found increased derealization of the 7r-electron bonding system. The r-space resolution 
should be sufficient to resolve the C'-N and the C'=0 distances if the gas- phase model of 
the molecule is valid and at high Q the respective interference pattern should be observed. 
Different from the neutron case these distances are not significantly obscured by hydrogen- 
heavy atom distances. This is illustrated in Fig. |l but it is also apparent from the figure 
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that the counting statistics reached at high Q-values so far is not sufficient to support the 
finding from I. The beam fluctuations mentioned in chapter 3 also contribute to the uncer- 
tainty in S(Q). In this respect, the data quality will probably improve in the near future, 
especially with the introduction of an area detector. It is also noted, that the Q-range up 
to about 14 A" 1 covered by conventional X-rays is not very conclusive for answering the 
question posed. 

6 Conclusion 

The structure of liquid NMF has been investigated using the recently reported methods 
for high energy photon diffraction. The structure factor derived is shown to be in much 
better agreement with the information available from neutron diffraction measurements 
than the earlier study of Ohtaki et al. using low energy photons (conventional Mo-K Q - 
radiation). In the intermediate Q-range from about 3-8 A -1 some discrepancies between the 
experimental self- scattering intensity and the self-scattering predicted by the independent 
atom approximation are observed. These discrepancies are accounted for by a simple charged 
atoms model, based on charge parameters from solid state formamide. The introduction of 
charged atoms largely improves the low-r behavior of gxir). Although promising for testing 
a finding from neutron diffraction - that the intramolecular structure of liquid NMF is closer 
to the solid state structure than to the structure of the gas phase - this aim could not be 
reached as the counting statistics were not good enough at high Q values. 

High energy photon diffraction is likely to see considerable progress and will be a useful 
tool for understanding the physics of systems without long-range order. Its most outstanding 
property will be the ease with which information from neutron and photon diffraction can 
be combined. 

7 Appendix 

In this appendix the solution of the integrals appearing in Eq. |6| will be sketched, first in 
the independent atom approximation using RHF wave-functions, then in the charged atoms 
model given by Eq. || RHF wave-functions are of the form: 

p 

where C p are the expansion coefficients of the basis functions \ which are of the Slater type: 

Xp \ a (r, 8, <P) = R pXa (r)Y Xa [9, 0) (9) 

with 

R p (r) = const. (^ p ) r B-1 e _ ^ r (10) 

where n is the quantum number of the basis function, A indicates the symmetry species, 
a the subspecies of the electron wave-function and £, p \ a the orbital exponent (The angular 
dependence of the 2p-type orbitals averages out). Inserting the RHF wave- functions into 
Eq. ^| creates integrals of the type: 

1 f°° 

-const. / r ni+ " 2_1 e- ( * 1+&)r sin(Qr)dr (11) 

Q Jo 
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which can be solved analytically [psf . With e. g. ni = 1 and ri2 = 2 the above integral 
equals to: 




-£i- 5 £ 2 2 - 5 [(6 +6) 2 + Q 2 ]- 1 - 5 sin(3arctan[Q/(ei +6)]) (12) 



When modifying the wave-functions as in Eq. [7] the term in Eq. [12] e. g. has to be modified 
to: 



It should be noted that the Q — > limit of f 2 m l ne charged atoms model is no longer 
Zf but ^2i{Zi + qi) 2 , where Zi is the nuclear charge and qi are the partial charges of the 
atoms, qi is related to k by: 



where N va \ cncc is the number of valence electrons in the independent atom and N COTe is the 
number of core electrons. 
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Figure 1: The high energy X-ray raw data 
From top to bottom: scattering from NMF in a glass tube, from the empty glass tube, and 
background scattering (mainly air scattering). The high-Q data are divided by 10 for better 
visibility. The ratio counts/monitor counts is larger for the high-Q scans as the distance 
sample/detector is shorter and hence the sampled solid angle larger. 



Figure 2: The fully corrected and normalized data 
Full line: isotropic part of the scattering intensity (Self scattering + Compton part); broken 
line: experimental intensity. Note that the slightly higher level of the isotropic part in the 
high-Q region is caused by the higher energy (lower relativistic correction) of the photons 
used. Insert: Zoom into the region Q > 6 A -1 . 



Figure 3: RMC- fit to the neutron data 
From top to bottom: Fit to the NMF-d5 data, fit to the NMF-d4 data, the resulting hydrogen 
first order difference. Full line: RMC simulation, broken line: experimental data from I. 
The first order difference is enhanced by a factor of 5. 



Figure 4: Comparison RMC (neutron data) versus experiment (high energy photons) 
Full line: high energy photons, self scattering in the independent atom approximation, 
broken line: high energy photons, self scattering from the charged atoms model (see text), 
points: RMC simulation, neutron data as input only. 



Figure 5: Comparison RMC (neutron data) versus experiment (high energy photons) 
Same as Fig. |[ but in real space. 



Figure 6: Comparison K(Q) versus [Q] fiaa) 2 - XX/cm) 2 ] 
Full line: K(Q) is a 'correction'- function defined in the text, broken line: difference between 
the self-scattering calculated from the independent atom approximation and the charged 
atoms model. 



Figure 7: Comparison of low energy photons with high energy photons and neutrons 
Full line (with diamonds): low energy photons (from Ohtaki et al. [||, broken line (longer 
lines): RMC simulations, neutron data as input only, broken line (shorter lines): high energy 
photons (this work), self scattering in the charged atoms approximation, dots: high energy 
photons (this work), self scattering in the independent atom approximation. 



Figure 8: Comparison gas-phase model versus solid state model of the intramolecular struc- 
ture 

Diamonds: experimental points, solid line: intramolecular i(Q) based on the gas phase model 
with CO = 1.22 A and C'N = 1.35 A, broken line: intramolecular i(Q) based on the solid 
state model with CO = 1.26 A and C'N = 1.29 A, points: smoothed experimental points. 
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Tabic 1: Constraints used for the RMC simulations 



N-H 

C 

c 
o 

C'-H 
C-H 



N 



0.86 
1.21 
1.36 
2.17 
1.79 
1.89 



1.15 
1.38 
1.56 
2.38 
2.18 
2.30 



N-H 

1.78 2.16 

1.92 2.33 

2.97 3.33 

1.86 2.48 



C 

1.78 2.16 

2.29 2.58 
1.19 1.35 
0.94 1.26 



C 



1.92 
2.29 



2.33 
2.58 



2.64 3.09 
3.10 3.51 
0.91 1.23 



O 

2.97 3.33 
1.19 1.35 
2.64 3.09 

1.89 2.25 



C'-H 
T86 2+8~ 
0.94 1.26 
3.10 3.51 
1.89 2.25 



C-H 



0.91 1.23 



1.48 2.00 



All distances in A. Left number lower limit, right number upper limit for the respective intramolec- 
ular distance. 



Table 2: Charge parameters for the charged atoms model (cm) 





N 


N-H 


C 


C 


O 


C'-H 


C-H 


K 


0.9567 


1.1856 


1.0465 


1.0093 


0.9712 


1.0102 


1.0357 


ft 


-0.78 


+0.40 


+0.51 


+0.10 


-0.55 


+0.03 


+0.10 
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